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UBIQUITIN-BASKn s plit PROTEIN SENSOR 

Government Support 

Work described herein was supported by grants from 
the United states government and the government has 
rights to the invention described herein. 

Background of the Invention 

Multiprotein complexes mediate the bulk of 
biological processes. A crucial part of our 
understanding of these processes is therefore based on 
knowing which proteins interact with a protein of 
interest. This knowledge is extensive for oligomeric 
proteins whose subunit interactions are strong enough to 
withstand in vitro conditions. However, many oligomeric 
complexes, while relevant physiologically, are either 
transient, intrinsically unstable, or are destabilized 
upon dilution, depletion of cof actors and other 
perturbations that accompany a transition from in vivo 
to in vitro conditions. m part as a result of this 
difficulty, the existing knowledge encompasses but a 
small fraction of the actually occurring, 
Physiologically relevant protein-protein interactions 
even among the best understood organisms. Moreover, 
even for proteins whose in vivo protein ligands are 
partly known, this knowledge is often of a qualitative 
kind; it rarely includes the actual affinities, let 
alone kinetic aspects of an in vivo interaction. 
Limitations of the existing in vivo methods are a major 
reason for this impasse. 

Assays for in vivo protein interactions include 
crossl inking of interacting proteins with a 
cell-penetrating agent such as formaldehyde, and use of 
fluorescence resonance energy transfer to follow the 
interactions of dye-coupled proteins microinjected into 
living cells. Genetic analyses of in vivo protein 
interactions include searches for extragenic suppressor 
mutations or synthetic lethal mutations, which occur in 
genes whose products are at least functionally (and 
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often physically) associated with a gene product of 
interest. A more recent approach, the two-hybrid 
technique, is based on expressing one test protein as a 
fusion to a DNA-binding domain of a transcriptional 
activator, and expressing another test protein as a 
fusion to a transcriptional activation domain. If the 
test proteins interact in vivo, a transcriptional 
activator is reconstituted, resulting in the induction 
of a reporter gene. 

The repertoire of existing assays for in vivo 
protein interactions, while expanding, is deficient in 
that significant questions cannot be answered due to 
technical limitations. New techniques which fill these 
experimental voids would represent important steps 
15 forward in the art. 

Summary of the Invention 

In one aspect, the present invention relates to 
compositions and methods useful for studying 
interactions between proteins. A composition of the 
20 invention is a fusion protein comprising an N-terminal 

subdomain of ubiquitin, fused to a non-ubiquitin protein 
or peptide. A second composition of the invention is a 
fusion protein comprising a C-terminal subdomain of 
ubiquitin, fused to the N-terminus of a non-ubiquitin 
25 protein or peptide. When contacted with one another, 
provided that the non-ubiquitin proteins or peptides 
interact (bind) with one another, the N- and C-terminal 
ubiquitin subdomains associate to reconstitute a quasi- 
native ubiquitin moiety which is recognized and cleaved 
30 by ubiquitin-specif ic proteases. As discussed in 

greater detail below, either the N-terminal subdomain of 
ubiquitin or the C-terminal subdomain of ubiquitin must 
be mutationally altered to reduce the ability of the 
ubiquitin subdomains to reconstitute a quasi-native 
3 5 ubiquitin moiety. 
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Other compositions of the present invention which 
relate to the aspect of the invention described above 
include DNA-based expression vectors encoding fusion 
proteins of the type described in the preceding 
paragraph- In addition, DNA-based expression vectors 
containing an expression cassette containing randomly 
cleaved DNA from an organism fused to either the N- or 
C-terminal ubiguitin subdomains are disclosed. Such 
constructs are useful for expression library screening. 

Compositions of the type described above are useful 
in methods for studying protein/protein interactions. 
Fusion proteins containing an N-terminal ubiguitin 
subdomain are contacted with fusion proteins containing 
a C-terminal ubiguitin subdomain. Provided that protein 
or peptide components other than the ubiguitin 
components interact with one another, the "effective" 
(local) concentration of the N- and C-terminal ubiguitin 
subdomains is increased, thereby promoting 
reconstitution of a quasi-native ubiguitin moiety, which 
is subsequently cleaved by ubiquitin-specif ic proteases. 
A number of assay formats, including in vivo and in 
vitro formats, are described herein. 

The first aspect of the invention discussed above 
is applicable to the identification of interacting 
protein or peptide pairs when one member of the 
specifically binding pair is known. In a second aspect, 
the invention relates to compositions and methods for 
studying the interaction of two members of a specific 
binding pair, both of which are predetermined. For 
example, the second aspect of the invention relates to 
the determination of a predetermined ligand in a sample. 
The second aspect of the invention is also applicable to 
the identification of an inhibitor of binding of an 
analyte (or, more generally, a ligand) to an affinity 
reagent. 
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The second aspect is also based on the use of 
ubiguitin fusion constructs wherein N— and C-terminal 
subdomains of ubiquitin are fused to first and second 
members of a specific binding pair, respectively. The 
specific binding of the members of the specific binding 
pair is detected by the activation of reporter following 
cleavage of the C-terminal ubiquitin fusion construct by 
a ubiquitin-specific protease. in practice, one member 
of the specific binding pair is selected to mimic the 
binding characteristics of a ligand to be determined. 
In the absence of a competitor molecule (e.g., a ligand) 
in the sample being analyzed, a baseline level of 
reporter activity is generated based on the binding 
interaction of the two members of the specific binding 
pair which results in cleavage by a ubiquitin-specific 
protease and activation of the reporter. However, the 
presence of ligand in the sample being analyzed acts as 
a competitive inhibitor of the binding between the two 
ubiquitin fusion constructs, thereby reducing the rate 
of ubiquitin-specific protease activity and, as a 
consequence, levels of reporter activity. 

Brief Descrip tion of the Drawing s 

Figure 1 is a diagram representing split ubiquitin 
as a proximity sensor in vivo. (A) a newly formed 
ubiquitin (Ub) moiety bearing an insertion (wavy thin 
line) between its N-terminal (semicircle denoted as 
" N ub"'*) and C-terminal (semicircle denoted as "c «;) 
"halves", and linked to a reporter protein (oval' denoted 
as "Re";). The insertion did not detectably interfere 
with the Ub folding, which was required for the rapid in 
vivo cleavage of the fusion by Ub-specific proteases 
(UBPs; lightning arrow), yielding the free reporter. 
(B) The result in A suggested that the N ub and c ub halves 
of Ub can be viewed as its distinct subdomains (Fig. 2) . 
When these subdomains were coexpressed as separate 
fragments, with C^ still linked to the reporter, 
significant in vivo reconstitution of a quasi-native 
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(recognizable by UBPs) Ub moiety was observed. (C) In 
vivo reconstitution of Ub from its separate, coexpressed 
fragments did not occur with an altered N ub fragment, 
denoted as N™ b (mutant) , that bore a single-residue ' 
5 replacement at position 13. Conformational 

destabilization of N™ b relative to its wild-type 
counterpart N ub is indicated by the altered shape of the 
N™ b subdomain. A cross over the arrow indicates the 
absence of Ub reconstitution. (D) Ubiquitin-based 
10 Split-protein sensor (USPS) . N * b , a mutational ly altered 
fragment of Ub that failed to reconstitute Ub in the 
presence of c ub< could still do so if the two Ub 
fragments were linked to polypeptides P, and P 2 (two 
ovals) that interacted in vivo. This interaction 
increased the effective (local) concentration of Ub 
fragments, and therefore increased the probability of N» b 
and C ub associating to form a quasi-native Ub moiety. 
This event was detected by the irreversible, diagnostic, 
UBP-mediated cleavage after the last residue of Ub in 
2 0 c ub , yielding the free reporter. Reduced conformational 
stability of Ub that has been reconstituted with N™ b 
instead of N^, is denoted by a gap between the 
"interacting" surfaces of Ub subdomains. 

Figure 2 is a ribbon diagram of Ub with the two 
subdomains identified in the present work, encompassing 
residues -1 to -37 and -38 to -76. A black triangle 
denotes the site of a 6S-residue insertion between the 
subdomains (Figs. 1A and 3-VI) . some of the residue 
numbers are indicated. He", the site of mutations 
analyzed in this work (Fig. 3), is in the second strand 
of the /3-sheet, where it interacts with the hydrophobic 
face of the a-helix. 

Figure 3 is a diagram showing the fusion constructs 
of the present invention. These fusions contained some 
of the following elements: (i) a Ub moiety, either 
wild-type (construct I) or bearing single-residue 
replacements at position 13 (constructs II-IV) , or at 
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positions 3 and 13 (construct V) . (ii) A Ub moiety 
containing the 68-residue insertion (denoted as "Ste6" 
in the diagram) derived from the cytosolic region of s. 
cerevisiae Ste6 between its transmembrane segments 4 and 
5 5. The insertion was positioned after residue 36 of Ub 
(construct VI). (iii) A Ub moiety bearing both the 
above insertion. and a single-residue replacement at 
position 13 (constructs VII-IX) . (iv) a C-terminal 
fragment of Ub (C^, residues 35-76) bearing the 
10 3 2 -residue, Ste6-derived sequence at its N-terminus 

(construct X) . (v) whose N-terminus was extended, 
via the linker sequence Gly-Glu-Ile-Ser-Thr (SEQ ID NO. 
5), with the 47-residue homodimerization motif ("leucine 
zipper", or z,) of S. cerevisiae Gcn4 (residues 235-281 
15 of Gcn4) (construct XV) . (vi) An N-terminal fragment of 
Ub (N^, residues 1-37) bearing the wild-type Ub sequence 
or a single-residue replacement at position 13, and a 
C-terminal extension containing the linker sequence 
Gly-Gly-Ser-Thr-Met (SEQ ID NO. 3) followed by the z, 
leucine zipper of Gcn4 (constructs XI-XIV) . (vii) Mouse 
dihydrofolate reductase (DHFR) bearing a C-terminal ha 
epitope (denoted as "DHFR" in the diagram and as "dha" 
in the text) . The explicitly indicated amino acid 
sequences are in single-letter abbreviations. All 
constructs were expressed from the induced promoter. 

Figure 4 is a diagrammatic representation of an 
assay for determining the interaction between an 
affinity reagent (AR) and a ligand (L) . 

Figure 5 is a diagrammatic representation of an 
assay for determining the interaction between a ligand 
(L) , a first affinity reagent (AR,)' and a second affinity 
reagent (AR 2 ) . 

Detailed Descripti on of the Invention 

Ubiquitin (Ub) is a 76-residue, single-domain 
protein (Fig. 2) whose covalent coupling to other 
proteins yields branched Ub-protein conjugates and plays 
a role in a number of cellular processes, primarily 
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through routes that involve protein degradation. Unlike 
the branched Ub conjugates, which are formed 
posttranslationally, linear Ub adducts are the 
translational products of natural or engineered Ub 
fusions. It has been shown that, in eukaryotes, newly 
formed Ub fusions are rapidly cleaved at the 
Ub-polypeptide junction by Ub-specif ic proteases (UBPs) . 
In the yeast Saccharomyces cerevislae, there are at 
least five species of UBP. Recent work has shown that 
the cleavage of a Ub fusion by UBPs requires the folded 
conformation of Ub, because little or no cleavage is 
observed with fusions whose Ub moiety was 
conformationally destabilized by single-residue 
replacements or a deletion distant from the site of 
15 cleavage by UBPs. 

The present invention is based on the discovery 
that a fusion protein comprising a ubiquitin subdomain 
is useful, for example, in a method for studying the 
interactions between members of a specific-binding pair. 
A "specific-binding pair", as used herein, refers to a 
pair of molecules which bind specifically to one another 
when incubated under physiological conditions. 
Typically, although not necessarily, both members of the 
specific binding pair are proteins and/ or peptides. For 
convenience, the term "protein(s) » will be used 
throughout the description of the present invention in 
the context of studying protein and /or peptide 
interaction as a shorthand expression for "protein (s) or 
peptide (s)". The two members of a specific-binding pair 
are often referred to as "ligand" and "affinity 
reagent", for example, in the case "of "antigen" and 
"antibody", respectively. These expressions are also 
employed herein. 

Briefly, it has been demonstrated that an N- 
terminal ubiquitin subdomain and a C-terminal ubiquitin 
subdomain, the latter bearing a reporter extension at 
its C-terminus, when coexpressed in the same cell by 
recombinant DNA techniques as distinct entities, have 
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the ability to associate, reconstituting a ubiguitin 
molecule which is recognized, and cleaved, by ubiguitin- 
specific processing proteases which are present in all 
eukaryotic cells. This reconstituted ubiguitin 
molecule, which is recognized by ubiguitin-specif ic 
proteases, is referred to herein as a guasi-native 
ubiquitin moiety. As disclosed herein, ubiguitin- 
specific proteases recognize the folded conformation of 
ubiguitin. Remarkably, ubiguitin-specif ic proteases 
retained their cleavage activity and specificity of 
recognition of the ubiguitin moiety that had been 
reconstituted from two unlinked ubiquitin subdomains. 

Ubiguitin is a 76-residue, single-domain protein 
comprising two subdomains which are relevant to the 
15 present invention - the N-terminal subdomain and the C- 
terminal subdomain. The ubiquitin protein has been 
studied extensively and the DNA sequence encoding 
ubiguitin has been published (Ozkaynak et al., EMBO J. 
61 1429 (1987)} (SEQ ID NO. 1). The N-terminal 
subdomain, as referred to herein, is that portion of the 
native ubiguitin molecule which folds into the only a- 
helix of ubiguitin interacting with two 0-strands. 
Generally speaking, this subdomain comprises amino acid 
residues from about residue number l to about residue 
2 5 number 3 6 . 

The C-terminal subdomain of ubiguitin, as referred 
to herein, is that portion of the ubiguitin which is not 
a portion of the N-terminal subdomain defined in the 
preceding paragraph. Generally speaking, this subdomain 
comprises amino acid residues from about 3 7 to about 76. 
It should be recognized that by using only routine 
experimentation it will be possible to define with 
precision the minimum requirements at both ends of the 
N-terminal subdomain and the C-terminal subdomain which 
are necessary to be useful in connection with the 
present invention. 
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In order to study the interaction between members 
of a specific-binding pair, one member of the pair is 
fused to the N-terminal subdomain of ubiquitin and the 
other member of the specific-binding pair is fused to 
the C-terminal subdomain of ubiquitin. Since the 
members of the specific-binding pair (linked to 
subdomains of ubiquitin) have an affinity for one 
another, this affinity increases the "effective" (local) 
concentration of the N-terminal and C-terminal 
subdomains of ubiquitin, thereby promoting the 
reconstitution of a quasi-native ubiquitin moiety. For 
convenience, the term "quasi-native ubiquitin moiety" 
will be used herein to denote a moiety recognizable as a 
substrate by ubiquit in-specific proteases. m light of 
the fact that the N-terminal and C-terminal subdomains 
of ubiquitin associate to form a quasi-native ubiquitin 
moxety even in the absence of fusion of the two 
subdomains to individual members of a specific-binding 
pair (Figure IB) , a further requirement is imposed in 
the present invention in order to increase the resolving 
capacity of the method for studying such interactions 
The further requirement is that either the N-terminal 
subdomain of ubiquitin, or the c-terminal subdomain of 
ubiquitin, or both, must be mutationally altered to 
reduce their ability to produce, through their 
association, a quasi-native ubiquitin moiety. it will 
be recognized by one of skill in the art that the 
binding interaction studies described herein are carried 
out under conditions appropriate for protein/protein 
30 interaction. Such conditions are provided in vivo 
(i.e., under physiological conditions inside living 
cells) or in vitro, when parameters such as temperature 
PH and salt concentration are controlled in a manner 
intended to mimic physiological conditions. 

The mutational alteration of a ubiquitin subdomain 
is preferably a point mutation. In light of the fact 
that it is essential that the reconstituted ubiquitin 
moiety must "look and feel" like native ubiquitin to a 
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ubiquitin-specif ic protease, mutational alterations 
which would be expected to grossly affect the structure 
of the subdomain bearing the mutation are to be avoided, 
A number of ubiquitin-specif ic proteases have been 
5 reported, and the nucleic acid sequences encoding such 
proteases are also known (see e.g., Tobias et al., J. 
Biol. Chem. 266: 12021 (1991); Baker et al., jr. Biol. 
Chem. 267: 23364 (1992)). It should be added that all 
of the at least five ubiquitin-specif ic proteases in the 
10 yeast S. cerevislae require a folded conformation of 

ubiquitin for its recognition as a substrate. Extensive 
deletions within the N- or C-terminal subdomains of 
ubiquitin are an example of the type of mutational 
alteration which would be expected to grossly affect 
subdomain structure and, therefore, are examples of 
types of mutational alterations which should be avoided. 

In light of this consideration, the preferred 
mutational alteration is a mutation in which an amino 
acid substitution is effected. For example, the 
substitution of an amino acid having chemical properties 
similar to the substituted amino acid (e.g., a 
conservative substitution) is preferred. Specifically, 
the desired mild perturbation of ubiquitin subdomain 
interaction is achieved by substituting a chemically 
similar amino acid residue which differs primarily in 
the size of its side chain. Such a steric perturbation 
is expected to introduce a desired (mild) conformational 
destabilization of a ubiquitin subdomain. The goal is 
to reduce the affinity of the N-terminal and C-terminal 
subdomains for one another, not necessarily to eliminate 
this affinity. 

In the Exemplification section which follows, the 
mutational alteration was introduced in the N-terminal 
subdomain of ubiquitin. More specifically, a first 
neutral amino acid residue was replaced with a second 
neutral amino acid having a side chain which differs in 
size from the first neutral amino acid residue side 
chain to achieve the desired decrease in affinity. in 
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the Example, the first neutral amino acid residue was 
isoleucine (either residue 3 or 13 of wild-type 
ubiquitin) . Neutral amino acids which have a side chain 
which differs in size from isoleucine include glycine, 
5 alanine and valine. 

A wide variety of fusion construct combinations can 
be used in the methods of this invention. One strict 
requirement which applies to all N- and C-terminal 
fusion construct combinations is that the C-terminal 
10 subdomain must bear an amino acid (e.g., peptide, 

polypeptide or protein) extension. This requirement is 
based on the fact that the detection of interaction 
between two proteins of interest linked to two 
subdomains of ubiquitin is achieved through cleavage 
after the C-terminal residue of the quasi-native 
ubiquitin moiety, with the formation of a free reporter 
protein (or peptide) that had previously been linked to 
a C-terminal subdomain of ubiquitin. Ubiquitin-specif ic 
proteases cleave a linear ubiquitin fusion between the 
C-terminal residue of ubiquitin and the N-terminal 
residue of the ubiquitin fusion partner, but they do not 
cleave an otherwise identical fusion whose ubiquitin 
moiety is conf ormationally perturbed. In particular, 
they do not recognize as a substrate a C-terminal 
subdomain of ubiquitin linked to a "downstream" reporter 
sequence, unless this C-terminal subdomain associates 
with an N-terminal subdomain of ubiquitin to yield a 
quasi-native ubiquitin moiety. 

Furthermore, the characteristics of the C-terminal 
amino acid extension of the C-terminal ubiquitin 
subdomain must be such that the products of the cleaved 
fusion protein are distinguishable from the uncleaved 
fusion protein. In practice, this is generally 
accomplished by monitoring a physical property or 
3 5 activity of the C-terminal extension which is cleaved 
free from the C-terminal ubiquitin moiety. It is 
generally a property of the free C-terminal extension 
that is monitored as an indication that a quasi-native 
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ubiquitin has formed, because monitoring of the quasi- 
native ubiquitin moiety directly is difficult in 
eukaryotic cells due to the presence of native 
ubiquitin. While unnecessary for the practice of the 
present invention, it would of course be appropriate to 
monitor directly the presence of the quasi-native 
ubiquitin as well, provided that this monitoring could 
be carried out in the absence of interference from 
native ubiquitin (for example, in prokaryotic cells, 
which naturally lack ubiquitin) . 

The size of the C-terminal extension which is 
released following cleavage of the quasi-native 
ubiquitin moiety within a reporter fusion by a 
ubiquitin-specif ic protease is a particularly convenient 
characteristic in light of the fact that it is 
relatively easy to monitor changes in size using, for 
example, electrophoretic methods. For instance, 'if the 
C-terminal reporter extension has a molecular weight of 
about 2 0 kD, the cleavage products will be 
distinguishable from the non-cleaved quasi-native 
ubiquitin moiety by virtue of the appearance of a 
previously absent reporter-specific 20 kD band following 
cleavage of the reporter fusion. 

in light of the fact that the cleavage can take 
Place, for example, in crude cell extracts or in vivo, 
it is generally not possible to monitor such changes in 
molecular weight of cleavage products by simply staining 
an electrophoretogram with a dye that stains proteins 
nonspecifically, because there are too many proteins in 
the mixture to analyze in this manner. one preferred 
method of analysis is immunoblotting. This is a 
conventional analytical method wherein the cleavage 
products are separated electrophoretically , generally in 
a polyacrylamide gel matrix, and subsequently 
transferred to a charged solid support (e.g., 
nitrocellulose or a charged nylon membrane) . ' An 
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antibody which binds to the reporter of the ubiquitin- 
specific protease cleavage products is then employed to 
detect the transferred cleavage products using routine 
methods for detection of the bound antibody. 
5 Another useful method is immunoprecipitation of 

either a reporter-containing fusion to C-terminal 
subdomains of ubiquitin or the free reporter (liberated 
through the cleavage by ubiquitin-specif ic proteases 
upon reconstitution of a quasi-native ubiquitin moiety) 
10 with an antibody to the reporter. The proteins to be 
immunoprecipitated are first labeled in vivo with a 
radioactive amino acid such as 35 S-methionine, using 
methods routine in the art. A cell extract is then 
prepared, and reporter-containing proteins are 
precipitated from the extract using an anti-reporter 
antibody. The immunoprecipitated proteins are 
fractionated by electrophoresis in a polyacrylamide gel, 
followed by detection of radioactive protein species by 
autoradiography or f luorography . 

There are a variety of formats in which these 
analyses can be carried out. The critical limitation is 
that the antibody binding pattern to the uncleaved 
quasi-ubiquitin complex must be distinguishable from the 
pattern for the cleaved product. For example, the 
25 antibody can bind specifically to an epitope of the C- 
terminal reporter moiety, or it may be a polyclonal 
antibody preparation specific for the reporter. It is 
also preferable, for the clearest experimental results 
(although this is not a strict requirement) , that the 
3 0 epitope selected is not one which is native to the 

system (host cell or extract) in which the experiment is 
being carried out. 

Thus, for example, a preferred experimental design 
is to extend the C-terminal subdomain of ubiquitin with 
3 5 a peptide containing an epitope foreign to the system in 
which the assay is being carried out. It is also 
preferable to design the experiment so that the C- 
terminal reporter extension of the C-terrainal subdomain 
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of ubiquitin is sufficiently large, i.e., easily 
detectable by the electrophoretic system employed. In 
this preferred embodiment, the C-terminal reporter 
extension of the C-terminal subdomain should be viewed 
as a molecular weight marker. The characteristics of 
the extension other than its molecular weight and 
immunological reactivity are not of particular 
significance. it will be recognized, therefore, that 
this C-terminal extension can represent an amalgam 
comprising virtually any amino acid sequence combination 
fused to an epitope for which a specifically binding 
antibody is available. This is demonstrated in the 
Exemplification section wherein the C-terminal extension 
of the C-terminal ubiquitin subdomain was a combination 
of the «ha" epitope fused to mouse DHFR (an antibody to 
the "ha" epitope is readily available) . 

Aside from the molecular weight of the C-terminal 
amino acid extension of the C-terminal ubiquitin 
subdomain, other characteristics can also be monitored 
ln ° rdSr t0 detect cleavage of a quasi-native ubiquitin 
moiety. For example, the enzymatic activity of some 
proteins can be abolished by extending their N-termini 
Such a "reporter" enzyme, which, in its native form, 
exhibits an enzymatic activity that is abolished when 
the enzyme is N-terminally extended, can also serve as 
the C-terminal reporter linked to the C-terminal 
ubiquitin subdomain. 

in this detection scheme, when the reporter is 
present as a fusion to the C-terminal ubiquitin 
subdomain, the reporter protein is inactive. However 
if the C-terminal ubiquitin subdomain and the N-terminal 
ubiquitin subdomain associate to reconstitute a quasi- 
native ubiquitin moiety in the presence of a ubiquitin- 
specific protease, the reporter protein will be 
released, with the concomitant restoration of its 
enzymatic activity. This method for monitoring cleavage 
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is particularly useful in an in vitro assay for protein 
interactions. The in vitro assay will be discussed in 
greater detail below. 

The two methods discussed above for monitoring 
cleavage of the quasi-native ubiguitin moiety are meant 
to be examples only. other methods can be devised 
through the use of routine experimentation. 

The quasi-native ubiquitin reconstitution assay for 
determining interaction between members of a specific- 
binding pair can be carried out in a number of formats. 
Common to such formats is the use of two DNA-based 
expression constructs. DNA-based expression constructs 
are genetic elements which can replicate and express the 
desired proteins within the experimental system 
15 employed. The starting material for such expression 
constructs will typically be a well-characterized 
expression vector (e.g., a plasmid) which contains 
regulatory elements (e.g., the origin of replication, 
promoters, etc.) which are suitable for use within a 
20 given experimental system. For example, eukaryotic and 
prokaryotic expression vectors differ in the types of 
regulatory sequences which they contain. Many 
expression vectors suitable for use in a variety of 
experimental systems have been reported and are used 
25 routinely by those skilled in the art. A review of this 
fundamental information will not be undertaken here. 

The methods of this invention can be used to 
determine binding between two predetermined proteins 
(e.g., to determine whether they comprise members of a 
30 specific binding pair). The methods are also applicable 
to the determination of binding between a predetermined 
member of a specific-binding pair and a previously 
unidentified member of the specific binding pair. The 
expression "determine" or "determination", as used in 
this context, is meant to include qualitative as well as 
quantitative assessment of binding interactions. 
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When determining binding between two predetermined 
proteins, each of the two DNA-based expression 
constructs is engineered to contain an expression 
cassette encoding a fusion protein. In a first 
embodiment, the methods of this invention are useful for 
determining whether the two predetermined proteins 
comprise members of a specific binding pair. When 
practicing this embodiment, the first fusion protein 
comprises an N-terminal ubiquitin subdomain fused to a 
first protein (say, Pi) to be tested for its ability to 
interact with {i.e., to bind to) a second protein (say, 
P2). The second fusion protein comprises a C-terminal 
ubiquitin subdomain fused to P2. In one embodiment of 
this invention, a C-terminal domain of ubiquitin can be 
fused to P2 at its N-terminus and to a reporter at its 
C-terminus. As discussed above, either the N-terminal 
ubiquitin-subdomain or the C-terminal ubiquitin 
subdomain, or both, are mutationally altered to reduce 
their ability to associate to form a quasi-native 
20 ubiquitin moiety. When the encoded fusion proteins are 
contacted with one another, the interactions between Pi 
and P2 will greatly increase the local concentration of 
the two ubiquitin subdomains, thereby promoting 
association and the reconstitution of a quasi-native 
25 ubiquitin moiety, which is then cleaved at the junction 
between the reporter and C-terminal ubiquitin domains by 
a ubiquitin-specif ic protease. 

The summary description in the preceding paragraph 
does not discuss certain important experimental 
30 considerations. For example, in light of its role as an 
affinity component, it will be recognized that Pi can be 
fused to the N-terminus or the C-terminus of the N- 
terminal ubiquitin subdomain. Similarly, P2 can be 
fused to the N-terminus or the C-terminus of the C- 
3 5 terminal ubiquitin subdomain. If P2 is fused to the C- 
terminus of the C-terminal ubiquitin subdomain, it will 
be removed by cleavage by the ubiquitin-specif ic 
protease, providing that the ubiquitin subdomains 
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associate to form a quasi-native ubiquitin moiety. 
Consistent with the summary description in the preceding 
paragraph, if the P2 moiety is fused to the C-terminus 
of the C-terminal ubiquitin subdomain, it may also be 
used as a reporter for detecting reconstitution of a 
quasi-native ubiquitin moiety. Furthermore, the 
position of P 2 within the C-terminal reporter-containing 
region of the fusion is not a critical consideration. 

The two DNA-based expression constructs can be co- 
expressed in a host cell known to contain ubiquitin- 
specific proteases (all eukaryotic cells are known to 
contain such proteases, whose specificity of cleavage is 
either identical or substantially similar among 
different species) . When maintained under conditions 
appropriate for metabolic activity, the fusion proteins 
encoded by both DNA-based expression constructs will be 
expressed. The cells are subsequently lysed and 
analyzed for the release of the C-terminal reporter 
extension of the C-terminal ubiquitin subdomain. This 
release will occur at significant levels only if pi and 
P2 interact under the conditions of the assay. 

This co-expression format is carried out, for 
example, in a eukaryotic cell culture (e.g., yeast) 
which contains a ubiquitin specific protease. 
Alternatively, the DNA-based expression vectors can be 
co-expressed in prokaryotic cells (e.g., E. coli) which 
have been transformed with an expressible gene encoding 
a ubiquitin-specific protease. Genes encoding 
ubiquitin-specific proteases have been reported 
previously, and the introduction of such genes in 
expressible form into a prokaryotic cell is routine. 

An alternative to coexpression of the two DNA-based 
expression vectors is the expression of the two vectors 
in different host cell cultures. Extracts from the two 
cell cultures (or purified, or partially purified 
preparations of the two fusion proteins) are then 
combined in presence of a ubiquitin-specific protease. 
This method permits in vitro analysis of protein 
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interactions. For example, the fusion proteins can be 
isolated individually, and one can be attached to a 
solid support (e.g., to a well in a multiwell plate). 
The second fusion protein is then contacted with the 
5 plate-affixed fusion protein in the presence of a 

ubiquitin-specif ic protease and, if Pi and P2 bind to 
one another, the C-terminal reporter extension will be 
cleaved from the C-terminal ubiquitin subdomain. 
Whether cleavage has occurred can be determined, for 
10 example, by the methods described above. A type of 
reporter described above, which is enzymatically 
inactive until its N-terminal extension is cleaved off, 
can also be used in the in vitro version of this method. 
In a second embodiment in which the binding 
15 interaction of two predetermined proteins are studied, 
it is known in advance that both proteins comprise 
members of a specific binding pair. This embodiment is 
useful, for example, as an alternative to the enzyme- 
linked immunoadsorbent assay (ELISA) , wherein the two 
members of a specific-binding pair are used to determine 
the presence or concentration of one of the members of 
the specific-binding pair (or a homolog thereof) in a 
sample. This aspect of the invention is shown 
diagrammatically in Figure 4. 
25 The terms "ligand" (L in Fig. 4) and "affinity 

reagent" (AR in Fig. 4) are used in this context to 
describe members of the specific-binding pair. In the 
practice of this embodiment of the invention, a sample 
is provided which is to be tested for the presence of 
30 one member of a specific-binding pair, the ligand. In a 
preferred embodiment, the affinity "reagent is a protein 
(preferably, an antibody) . Two fusion constructs are 
employed to determine the presence of the ligand in the 
sample to be analyzed. A first fusion construct 
35 comprises an N-terminal subdomain of ubiquitin linked to 
an affinity reagent which is known to bind to the 
ligand. The second fusion construct comprises a C- 
terminal subdomain of ubiquitin linked at its N-terminus 
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to the ligand (or a binding homolog thereof) and at its 
C-terminus to a reporter which is inactive when linked 
to the C-terminal subdomain of ubiguitin. As with all 
embodiments of the present invention, either the N- 
terminal subdomain or the C-terminal subdomain, or both, 
are mutationally altered to reduce the ability of the N- 
and C-terminal subdomains to associate to reconstitute a 
guasi-native ubiguitin moiety when contacted under 
conditions appropriate for protein/protein interaction. 
It will be recognized that in an alternative of this 
format, the fusion constructs can be designed so that 
the first fusion construct comprises an N-terminal 
subdomain of ubiguitin linked to a ligand and the second 
fusion construct comprises a C-terminal subdomain of 
ubiguitin linked at its N-terminus to an affinity 
reagent. The C-terminus of the second fusion construct 
must always be linked to a reporter which provides 
signal only following cleavage by a ubiguitin-specif ic 
protease. 

Prior to analyzing the sample in which the presence 
of ligand is to be determined, it is first preferable to 
generate a standard curve correlating predetermined 
levels of ligand in a sample with detected levels of 
ubigui tin-specific protease activity. To determine 
background levels of activity in the absence of free 
ligand, predetermined guantities of the first and second 
fusion constructs are incubated under conditions 
appropriate for binding of the analyte of interest to 
the affinity reagent. This incubation mixture is then 
contacted with a ubiguitin-specif ic protease under 
conditions appropriate for protease activity. A 
background level of protease activity is determined 
which is based on the formation of a guasi-native 
ubiguitin moiety resulting primarily from the binding 
affinity of the ligand and the affinity reagent. 

Other points of the standard curve are generated by 
forming an incubation mixture of the type described in 
the preceding paragraph with the addition of 
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predetermined quantities of unbound ligand which acts as 
a competitor with the ligand component of the first 
fusion construct thereby effecting a decrease in the 
reconstitution of the quasi-native ubiquitin moiety and 
5 a concomitant decrease in the reporter activity. 

Following the construction of the standard curve 
representing reporter activity as a function of ligand 
concentration, an otherwise identical incubation is 
established using the sample to be tested for the 
10 presence of ligand. Following an incubation period 

appropriate for binding of ligand to affinity reagent, 
the incubation mixture in contacted with a ubiquitin- 
specific protease under conditions appropriate for 
protease activity. The level of reporter activity is 
determined and, by reference to the standard curve, the 
level of reporter activity is correlated with the ligand 
concentration in the sample being analyzed. 

In a third embodiment in which the binding 
interaction of two predetermined proteins are studied, 
the methods of this invention can be used to identify an 
inhibitor of the binding of a ligand to an affinity 
reagent. Fusion constructs of the type described 
previously are employed. More specifically, an li- 
ter mi nal subdomain of ubiquitin fused to either ligand 
25 or affinity reagent is provided. A C-terminal subdomain 
of ubiquitin fused at its N-terminus to the member of 
the specific-binding pair which is not fused to the N- 
terminal subdomain is also provided. The C-terminus of 
the C-terminal ubiquitin subdomain is fused to a 
30 reporter. Mutational alteration (s) which reduce the 

affinity of the ubiquitin subdomains are also introduced 
for the reasons discussed previously. 

Background levels of reporter activity in the 
absence of a sample to be tested for the ability to 
interfere with ligand/af f inity reagent binding are 
determined by incubations (preferably sequential) 
appropriate for l) protein/protein interaction; and 2) 
ubiquit in-specific protease activity. 
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The steps described above are then repeated with 
the addition of a compound or a mixture of compounds to 
be tested for the ability to interfere with the binding 
of the ligand to the affinity reagent, A decrease in 
5 the reporter activity resulting from the inclusion of 

the compound to be tested for the interfering ability is 
indicative of the presence of a compound (or compounds) 
having the ability to interfere with the binding 
interaction of the ligand and the affinity reagent. 
10 Another method of the present invention which is 

useful for determining the presence of a ligand in a 
sample employs a pair of affinity reagents (AR 1 and AR 2 ) , 
each of which binds the ligand. This aspect is shown 
diagrammatically in Figure 5. The affinity reagents 
15 have a binding specificity such that they can bind 
simultaneously to the ligand. Preferably the two 
affinity reagents are antibodies which specifically bind 
to distinct epitopes of the ligand. 

In practicing this method, two fusion constructs 
2 0 are provided. The first fusion construct comprises an 
N-terminal subdomain of ubiquitin linked to a first 
affinity reagent which specifically binds to a first 
epitope of the ligand. The second fusion construct 
comprises a C-terminal subdomain of ubiquitin linked at 
its N-terminus to a second affinity reagent which binds 
specifically to a second epitope of the ligand and 
linked at its C-terminus via an amide bond to a 
reporter. A mutational alteration (or alterations) is 
introduced in the ubiquitin subdomains in order to 
reduce the affinity of the N-terminal subdomain for the 
C-terminal subdomain. 

One point on a standard curve is generated by 
incubating predetermined quantities of the first and 
second subdomains under conditions appropriate for 
3 5 protein/protein interaction. Following this incubation, 
the components are then contacted with a ubiquitin- 
specific protease under conditions appropriate for 
protease activity. Cleavage of the quasi-native 
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ubiguitin complex present in the incubation mixture is 
determined by assaying for reporter activity. The level 
of reporter activity determined in this step represents 
background reporter activity in the absence of ligand. 

Additional points on the standard curve are 
generated by preparing otherwise identical incubation 
mixtures to which known quantities of ligand are added. 
Reporter activity is determined following appropriate 
incubations, and the data obtained using varying 
concentrations of added ligand is used to complete the 
standard curve . 

To analyze a sample for the presence of ligand, a 
mixture of the first and second fusion is then prepared, 
the mixture being identical to that described above but 
for the addition of a sample to be tested for the 
presence of ligand. Following appropriate incubations, 
reporter activity is determined. Reference to the 
standard curve correlates reporter activity with ligand 
concentration in the sample. 

In another aspect of the invention, an efficient 
method for the determination of binding between a 
predetermined member of a specific-binding pair and a 
previously unidentified member of the specific-binding 
pair is the library screening method of the present 
25 invention. This method is useful, for example, in 
screening for proteins which bind to a protein of 
interest. Like the method described above for studying 
interactions between two predetermined proteins, the 
library screening method can be practiced in vivo or in 
3 0 vitro. The library screening method requires the 
construction of a DNA library in one of the two 
expression vectors. The source of DNA for the DNA 
library can be, for example, cDNA, or genomic DNA from 
any organism of interest that has been fragmented to 
generate DNA fragments of a length appropriate for 
insertion into a DNA-based expression vector employed. 

More specifically, a first protein (Pi) is selected 
for use in a screening procedure designed to identify a 
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second protein (P2) which interacts with PI. A first 
DNA-based expression vector is provided which contains 
an expression cassette encoding a C-terminal subdomain 
of ubiquitin as described above, fused in frame to DNA 
encoding Pi (and to a reporter moiety if Pi does not 
double as a reporter) . In the fusion protein encoded by 
the expression cassette, Pi can be fused to either the 
N-terminus or the C-terminus of the C-terminal ubiquitin 
subdomain, while a reporter should be fused to the C- 
terminus of the C-terminal subdomain of ubiquitin. 

A second DNA-based expression vector containing an 
expression cassette is also provided. The second DNA- 
based expression cassette contains randomly generated 
DNA fragments (e.g., genomic or cDNA fragments) from an 
organism of interest fused to DNA encoding the N- 
terminal subdomain of ubiquitin. In the fusion protein 
encoded by the expression cassette, any protein (P2) 
encoded by the randomly generated DNA fragments from the 
organism of interest can be fused to either the C- 
terminus or the N-terminus of the N-terminal ubiquitin 
subdomain. All of the considerations discussed above in 
connection to the testing of two specific fusion 
constructs are relevant to all applications of a screen- 
based version of this method. 

It is necessary that the DNA encoding at least one 
of the two ubiquitin subdomains be mutationally altered 
to reduce the ability of the encoded N- and C-terminal 
subdomains of ubiquitin to associate to reconstitute a 
quasi-native ubiquitin moiety when co-expressed in a 
cell. The first and second expression vectors are then 
used to co-transform a suitable host cell. 

In the event that the randomly generated DNA from 
the organism of interest encodes a protein, it will be 
expressed as a fusion with the N-terminal ubiquitin 
subdomain. For purposes of discussion, it will be 
assumed that the open reading frame encodes a portion of 
a protein and that, in the fusion protein, it is fused 
to the N-terminus of the N-terminal ubiquitin subdomain. 
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In the event that proteins Pi and P2 interact with 
one another, the effective (local) concentration of the 
N- and C-terminal ubiquitin subdomains will be greatly 
increased, because they will be brought into mutual 
proximity through interactions between Pi and P2 . This 
local increase in the concentration of the ubiquitin 
subdomains promotes the association of the ubiquitin 
subdomains to reconstitute a quasi-native ubiquitin 
moiety- This quasi-native ubiquitin moiety is 
recognized and cleaved by ubiquitin-specif ic proteases 
in a cell. The final step in the method is to identify 
those cells in which the fusion protein encoded by the 
second DNA-based expression vector is cleaved by 
ubiquitin-specif ic proteases. This cleavage is 
indicative of interaction between PI and P2 and could be 
used to locate cells that express a vector encoding a 
library-derived, P2-containing fusion. Cleavage of the 
fusion is identified by any of the assays discussed 
above . 

This arrangement can also be reversed. That is, 
the randomly generated DNA fragment can be fused to the 
C-terminal ubiquitin subdomain rather than the N- 
terminal subdomain, and the DNA encoding Pi can be fused 
to the C-terminal subdomain. 

In addition to the in vivo method for screening a 
DNA expression library, an in vitro method is also 
disclosed herein. In the practice of the in vitro 
library screening method, two DNA-based expression 
vectors are also employed. in order to detect 
interactions between proteins of interest, constructs 
similar to those discussed in connection with the in 
vivo assay are employed. For example, DNA encoding Pi 
can be fused to DNA encoding the N-terminal ubiquitin 
subdomain. In the encoded fusion protein, Pi can be 
fused to either the N-terminus or the C-terminus of the 
N-terminal ubiquitin subdomain. 
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The design of the second DNA-based expression 
vector has limitations which do not apply to the 
counterpart vector described in connection with the in 
vivo library screening method. For example, in the in 
5 vitro method, the fusion protein encoded by the second 
DNA-based expression vector is arranged in such a way 
that P2 is fused to the N-terminus of the C-terminal 
ubiguitin subdomain. At its C-terminus, the C-terminal 
subdomain of ubiguitin is fused to a reporter protein 
10 which is inactive when N-terminally extended. As in 

previous applications described herein, at least one of 
the ubiguitin subdomains is mutationally altered to 
reduce the ability of the N- and C-terminal ubiguitin 
subdomains to associate to reconstitute a guasi-native 
15 ubiguitin moiety when contacted under conditions 
appropriate for protein/protein interactions. 

The fusion protein encoded by the first and second 
expression vectors are then expressed individually in a 
suitable host and subseguently purified. It will be 
2 0 recognized, of course, that the cells containing the 
second DNA-based expression vector must be grown and 
tested to ensure that a clonal population is being 
studied. One of the two purified fusion proteins is 
affixed to a solid support such as wells of a multiwell 
25 plate using standard technigues. The other fusion 
protein is then dissolved in a buffered solution 
appropriate for protein/protein interactions, and the 
resulting sample is brought into contact with the 
protein which is affixed to the solid support. in 
30 practice this is accomplished, for example, by placing 
the suspension in wells in the multiwell plate to which 
the other fusion protein has already been attached. 
When incubated under conditions appropriate for 
protein/protein interactions, the effective local 
3 5 concentration of the N- and C-terminal ubiguitin 

subdomains will be increased for the reasons discussed 
previously, providing that Pi and P2 interact with one 
another. The resulting guasi-native ubiguitin moiety 
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will be recognized by a ubiguitin-specif ic protease 
which is added to the incubation mixture and will cleave 
the complex at the C-terminus of the C-terminal 
ubiguitin subdomain. This cleavage will activate the 
reporter protein which is inactive when N-terminally 
extended. An assay for this newly generated activity is 
then carried out to determine whether cleavage occurred. 
Detection of cleavage by this assay is indicative of the 
interaction between PI and P2. 

When both Pi and P2 are specific, predetermined 
proteins, this assay can be used as a novel method for 
analyzing protein/protein interactions in vitro. When 
PI (or P2) is a collection of different proteins 
(analogous to a collection of different DNA fragments 
15 encoding different proteins in the in vivo version of 
the screen described above) , the in vitro assay is 
working as a screen for protein/protein interactions 
with individual members of the collection of proteins 
added to specific wells of a multiwell plate to which a 
20 fusion containing P2 , the C-terminal subdomain of 

ubiguitin, and a reporter is affixed. Alternatively, 
individual members of the collection of proteins are 
affixed to individual wells of a multiwell plate, and a 
fusion containing P2 and the C-terminal subdomain of 
25 ubiquitin is added to the wells in a buffered solution. 
The multiwell plate-based variants of this in vitro 
screen are but a few examples of the variants that can 
readily be devised through the application of only 
routine experimentation. 

30 Exemplification 

In the constructs of this work (Fig. 3), ubiguitin 
(Ub) was joined to the N-terminus (amino terminus) of 
the 21-kD mouse dihydrof olate reductase (DHFR). The 
C-terminus (carboxyl terminus) of DHFR was extended with 

35 the "ha" epitope tag, yielding a 22-kD dha (DHFR— ha) 
reporter. The final constructs resided in the CEN6 , 
^Pi-based vector pRS314 or in the CEN6 , Ui^3-based 
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PRS316 (Sikorski and Hieter, Genetics 122, 19 (1989) , 
and were expressed from the P^ promoter (inserted 
between the BamHI and EcoRI sites of the pRS314/16 
polylinker) . 

More specifically, the S. cerevisiae Ub gene 
engineered to contain a Sail site immediately upstream 
of its start codon was amplified, using PGR (see e.g 
Ausubel et al., current Protocols in Molecular Biology 
(Wiley, New York, 1992)), either from the Sail site to 
the first c of codon 37 or from codon 35 to codon 76. 
The primers were constructed in a way that yielded 
after ligation of the two amplified fragments, a Ub ORF 
which contained the GGG codon for Gly at position 35 
(instead of the synonymous wild-type GGT codon) , 
resulting in a BamHI site between codons 35 and' 3 7 
This ORF was ligated to a fragment encoding dha 
(DHFR-ha) as described by Johnson et al. (EMBO J. 11 
497 (1992)), yielding an ORF encoding Ub-dha (Fig. 3-1) 
which contained the sequence Met-Arg-Ser-Gly-ile-Met ' 
(SEQ ID NO. 4) between Gly™ of Ub and Val' of DHFR. 

Fragments encoding ub mutants (Fig. 3-11 m IV ) 
were produced by replacing the Sall-BstXI fragment 'in 
construct I with double-stranded (ds) oligonucleotides 
containing a 5' Sail site, a 3' end annealable to the 
BstXI overhang, and an altered codon at position 13 of 
Ub. Construct V was produced by replacing residues He 
3 and He 13 of ubiquitin in a Ub— DHFR fusion with 
Glycine residues. A fragment encoding the i68 insertion 
(Fig. 3-VI) was produced using PGR, S . cerevisiae 
genomic. DNA, and primers designed to amplify the region 
of STE6 (see e.g., McGrath and Varshavsky, Nature 340 
400 (1989); Kuchler et al., EMBO J. 3 973 (1989)) f rom 
codon 196 to codon 262. The primers contained 5' BamHI 
sites, so that the amplified, BamHI-cut fragment of STE6 
could be inserted into the BamHI site between the Ub 
codons 35 and 37 in constructs l-iv, yielding constructs 
VI-IX. To avoid a frameshift in the resulting ORFs, the 
sequence AA was inserted before the STE6 codon 196 
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resulting in a Glu residue between residue 3 6 of Ub and 
residue 196 of Ste6-the first STSff-derived residue of 
i68. Construct X (Fig. 3) was produced by replacing the 
Sall-Xbal fragment in construct VI with a ds 
5 oligonucleotide that supplied a start codon followed by 
two Gly codons. In the C^-dha fusion encoded by the 
resulting fragment (Fig. 3-X) , residue 35 of Ub was 
preceded by a 32-residue linker all of whose residues 
except the N-terminal Met-Gly-Gly were specified by 
10 codons 234 to 2 62 of STE6 . 

The portion of construct XV contained the 

above Ste6-derived sequence preceding the C ub moiety, the 
leucine zipper region of S. cerevisiae GCn4 (see e.g., 
Vinson et al., Science 246, 911 (1989) ; Hinnebusch, 
15 Proc. Natl. Acad. Sci . USA 81, 6442 (1984); O'Shea et 
al., Science 254, 539 (1991); Ellenberger et al. , Cell 
71, 1223 (1992); Pu and Struhl, Nucl . Acids Res. 21, 
4348 (1993)) (residues 235-281, denoted as , the 
construction-generated N-terminal Met, and the sequence 
20 Gly-Glu-Ile-Ser-Thr (Fig. 3-XV) (SEQ ID NO. 5). A 
fragment encoding z 1 was produced using PCR, s. 
cerevisiae genomic DNA, and primers that yielded the 
amplified product bearing single 5' BamHI and Sail sites 
and single 3' Bglll and Xbal sites. Construct XV was 
25 produced by replacing the Sall-Xbal fragment of 

construct VI with the PCR-amplif ied, Sall/Xbal-cut 
fragment. To produce constructs XI-XIV, the same 
fragment was cut with BamHI and Xbal, and was used to 
replace BamHI-Xbal fragments in derivatives of 
0 constructs VI-IX, so that the resulting fragments 

contained two consecutive stop codons in frame with the 
z^coding sequence (codons 235-281 of Gcn4); they also 
encoded Gly-Glu-Ile-Ser-Thr-Leu-Glu (SEQ ID NO. 6) 
C-terminally to z,, and Gly-Gly-Ser-Thr-Met (SEQ ID NO. 
5 3) between z, and N ub (Fig. 3). The z, motif in N^-z, and 
its derivatives but not in z^-dha bore a Met 250 -Thr 25 ° 
replacement (residue numbers of Gcn4) , which occurred 
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during construction; this replacement would be expected 
to weaken the interaction between z 1 domains. Final DNA 
constructs were verified by sequencing. 

Experiments were carried out to determine whether 
ubiquitin-specif ic proteases (UBPs) can cleave a fusion 
whose Ub moiety bears an insertion within a loop 
(residues 34-40) connecting the only a-helix of Ub to a 
/3-strand (Fig. 2). The test fusion, Ub i68 -dha, contained 
the dha reporter and a 68-residue insertion (denoted as 
i68) after residue 36 of Ub (Fig. 3-VI) • The sequence 
of the insertion was derived from the cytosolic region 
of the yeast Ste6 protein between its transmembrane 
segments 4 and 5 (McGrath and Varshavsky, Nature 340, 
400 (1989); Kuchler et al., EMBO J. 8, 3973 (1989)). 
This sequence was chosen because it was expected to be 
either flexible or folded in a way that positions its 
ends in proximity to each other. 

S. cerevisiae expressing Ub-dha or Ub i68 -dha (Fig. 
3-1, VI) were labeled with 35 S-methionine for 2 or 5 min 
at 3 0°C. Whole cell extracts (prepared in the presence 
of N-ethylmaleimide to inhibit UBPs) were incubated with 
anti-ha antibody, and immunoprecipitated proteins were 
analyzed by SDS-PAGE. All experiments used the YPH500 
strain of S. cerevlsiae (MAT a ura3 lys2 trpl ade2 his3 
Ieu2) grown at 30°C to A 6QQ of -1 in a synthetic (SD) 
medium containing 0.1 mM CuS0 4 . Transformation of S. 
cerevisiae, labeling with 35 S-methionine, cold chase, 
preparation of whole cell extracts in the presence of 
N-ethylmaleimide, immunoprecipitation with a monoclonal 
anti-ha antibody, SDS-PAGE (12%) and fluorography were 
carried out as described by Johnson et al . (EMBO J. 11, 
497 (1992)), except that "zero" time samples were 
withdrawn and processed 1 min (at 30 °C) after the 
addition of a chase medium containing cycloheximide and 
unlabeled methionine. The labeling time was either 2 or 
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5 rain, as indicated. Immunoblot analysis with anti-ha 
antibody was performed as described, except that the ECL 
detection system (Amersham) was used. 

The in vivo cleavage of Ub-dha and Ub i68 -dha was 
complete by the end of a 2-min pulse. Thus, a large 
insertion within the 34-40 region of Ub (Figs. 1A and 2) 
does not interfere with the recognition of Ub by UBPs, 
suggesting that at least within the limits of temporal 
resolution of the 2-min pulse-cleavage assay, this 
insertion does not perturb the folding of Ub. 

A fusion whose Ub moiety bears replacements of lie 3 
and lie 13 with Gly residues is cleaved in vivo much more 
slowly than an otherwise identical fusion bearing 
wild-type Ub. Since lie 3 and He 13 are buried in the 
15 hydrophobic core of Ub (Vijay-Kumar et al M j m Mol . 

Biol. 194, 531 (1987)), their conversion to glycines is 
expected to decrease the conformational stability of Ub 
without necessarily changing its overall folding 
pattern. To make similar but less destabilizing 
20 alterations in the sequence of Ub, only lie 13 was 
replaced with either Val, Ala or Gly hydrophobic 
residues of decreasing size. The resulting Ub fusions, 
Ub v13 -dha, Ub A13 -dha, and Ub G13 -dha (Fig. 3-II, Hi, iv) , 
were completely cleaved by the end of a 5-min pulse, as 
25 was also Ub-dha , bearing wild-type Ub. 

A combination of the i68 insertion (Fig. 2) and a 
substitution at position 13 of Ub was then tested to 
determine whether such a combination results in a less 
efficient cleavage of a fusion by UBPs. By the end of a 
30 2-min pulse, no uncleaved Ub i68 -dha, and at most traces 
of Ub vl3 « f68 -dha could be detected. However, the cleavage 
of Ub*"' ,68 -dha, and especially of Ub G13 -'" 68 -dha was much 
slower, in that significant amounts of the uncleaved 
fusions were observed by the end of a 2-min pulse (Figs. 
35 3-VIII, IX and 4B, lanes c and d) . 

The i68 insertion places the two "halves" of a 
nascent Ub farther apart (Figs. 1A and 2), and therefore 
is expected to retard the folding of Ub; this effect 
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was not detected apparently because of insufficient 
temporal resolution of the 2-min pulse-cleavage assay. 
However, the assay did detect a decrease in the rate of 
Ub folding with fusions whose Ub moieties bear both the 
5 insertion and another alteration such as Ile 13 -Gly", 
Which by itself is also insufficient to cause a 
detectable retardation of ub folding. These results can 
be interpreted within the diffusion-collision model of 
protein folding (Karplus and Weaver, Nature 260, 404 
10 (1976) ; Kim and Baldwin, Annu. Rev. Biochem. 51,' 459 
(1982); Jaenicke, Biochemistry 30, 3147 (1991)), i n 
which marginally stable units of isolated secondary 
structure form early and then coalesce into the native 
conformation, with the overall rate of folding dependent 
15 on both the stability of folded subdomains and on the 

rates of their collision and coalescence. In this view 
the relevant subdomains of Ub are its N-terminal and 
C-terminal regions (residues l to -36 and -37 to 76 
respectively). indeed, in the native Ub, its first' 34 
0 residues are folded into an a-helix interacting with a 
double-stranded antiparallel 0-sheet (Fig. 2) 
(Vijay-Kumar et al., J. M al . Biol. 194, 531 (1987)). 
Thus, the i68 insertion retards Ub folding primarily 
through a reduction in the frequency of collisions 
5 between the N-terminal and C-terminal subdomains of Ub 
whereas the effect of substitutions at position 13 is a 
decreased conformational stability of the N-terminal 
subdomain, caused by disruption of contacts between He 1 * 
in the second strand of the /3-sheet and the hydrophobic 
) face of the a-helix (Fig. 2). 

The finding that N^ and c b fragments of ubiquitin 
are its conformational subdomains is also supported by a 
spectroscopic study of the chemically synthesized halves 
of ubiquitin, in which it was found that fragments 
closely related to the N ub and as disclosed herein are 
largely unfolded in an aqueous buffer but display 
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spectroscopic properties indicative of partial folding 
in the presence of 2 0% methanol (a secondary structure- 
stabilizing solvent) (see Cox et al. , J. Mol . Biol. 234: 
483 (1993)). 

The relative insensitivity of Ub folding to a large 
insertion within the 34-40 loop (Figs. 1A and 2) 
suggested that separate, coexpressed N-terminal and 
C-terminal fragments of Ub might also be able to 
reconstitute the folded Ub conformation detectable by 
the UBP cleavage assay. m a test of this conjecture, a 
C-terminal fragment of wild-type Ub (residues 35-76, 
denoted as C^) was expressed as a fusion to the dha 
reporter (C ufa -dha) , while an N-terminal fragment of 
wild-type Ub (residues 1-37, denoted as ) was 
expressed as a fusion to the "leucine zipper" 
homodimerization domain of the yeast Gcn4 protein 
(Nub-z,, with z, denoting the zipper of Gcn4) (Fig. 3-x, 
XI) . 

S. cer&vislae expressing C^-dha either alone or 
together with N^-z, were labeled with 35 S-methionine for 
5 min at 30°c, followed by a chase for 0, 10 and 30 min, 
and SDS-PAGE analysis of labeled proteins 
immunoprecipitated with anti-ha antibody. When 
expressed by itself, C^-dha remained uncleaved at the 
junction between C ub and dha; instead, the entire fusion 
was slowly degraded. However, coexpression of C ufa -dha 
and N^-z, resulted in the cleavage of C ufa -dha, yielding 
the long-lived dha reporter which accumulated during a 
3 0-min chase. This cleavage was slow in comparison to 
the cleavage of otherwise identical fusions containing 
either the wild-type or the insertion-bearing Ub moiety; 
the latter cleavages were complete by the end of a 2 -min 
pulse. 

It can be concluded from this data that the 
cleavage of C ub -dha requires the presence of N^-z,, and 
is the consequence of an in vivo association between the 
C ub and moieties of these fusions. This association 

results in at least transient formation of a Ub moiety 



WO 95/29195 



PC1YUS95/04536 



-33- 

which is similar enough to native Ub to be a substrate 
of UBPs. The relatively low overall rate of the C^-dha 
cleavage apparently results from a low overall rate of 
"productive" in vivo collisions between the and 
5 fragments, in comparison to the rate of analogous 

collisions between the same subdomains of Ub when they 
are linked within a single polypeptide (Fig. 1A, B) . In 
other words, the effective (local) concentration of the 
two Ub subdomains is much higher for linked than for 
10 unlinked subdomains. 

The efficiency of reconstitution of native Ub from 
its coexpressed subdomains depended on conformational 
stability of the N-terminal subdomain: coexpression of 
C^-dha (Fig. 3-X) with either N^-z, or N^-z, (Fig. 
15 3-XIII, XIV), which bore Gly or Ala instead of wild-type 
lie at position 13, resulted in virtually no cleavage of 
C -dha, in contrast to the results with either n£-z, or 
nS 3 -z, (Fig. 3-XI, XII), which bore either He, the 
wild-type residue, or Val, a hydrophobic residue larger 
than Ala and Gly, at position 13 (Fig. 1C) . 

To analyze a conf ormationally destabilized Ub, it 
was determined whether the folding of a Ub moiety 
containing an altered subdomain could be "rescued" in 
trans by the wild-type version of the same subdomain. 
25 Ub^-dha (Fig. 3-V) , whose Ub moiety bears 

conformational^ destabilizing replacements of He 3 and 
He 13 with Gly residues, is cleaved in vivo very slowly: 
whereas the cleavage of Ub-dha was complete by the end 
of a 5-min or even a 2-min pulse, only -io% of Ub^-^-dha 
3 0 was cleaved by the end of a 5-min pulse, and -33% was 

cleaved after a 10-min chase. However, coexpression of 
Ub G3 < 13 -dha and N^-z, (Fig. 3-XI) increased the yield of 
cleaved Ub G3 ' 13 -dha (after 10 min of chase) from -33% to 
-66%. Significantly, this rate of cleavage was still 
35 lower than that observed with coexpressed N^-z, and 

c ub -dha ' when u b reconstitution could occur exclusively 
in trans (Fig. 1A) . 



20 
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Thus, although Ub G3 ' 13 is able to adopt a 
quasi-native (recognizable by UBPs) Ub conformation, the 
altered N-terminal subdomain of Ub 03 - 13 also Impedes the 
association between the subdomain of Ub 53 ' 13 and the 
5 coexpressed, trazis-acting subdomain. A model that 

accounts for these findings posits two ensembles of the 
conformational^ unstable Ub G3 < 13 moieties: (i) a set of 
"open" or "unfolded" Ub* 3 ' 13 conformations which allow the 
"invasion" by the coexpressed fragment and its 

10 coalescence with the C ub region of Ub G3 - 13 -dha, resulting 
in reconstitution of Ub moiety and in the fusion's 
cleavage; (ii) a set of "closed" or "folded" ub 03 - 13 
conformations which preclude the invasion by n£ . These 
conformations, in addition to interconverting with open 
15 conformations, include quasi-native Ub species that can 
be recognized by UBPs. 

That fragments of a protein can reassociate to form 
a functional, quasi-native species has been demonstrated 
for a variety of proteins other than Ub; the examples 
20 include ribonuclease A, staphylococcal nuclease and 

other proteins in vitro {Anfinsen et al. , Enzymes 4, 177 
(1971); Hantgan and Taniuchi , J . Biol. Chem. 252, 1367 
(1977); Holmgren and Slaby, Biochemistry 18, 5591 
(1979); Galakatos and Walsh, Biochemistry 26, 8475 
25 (1987); Girons et al., J. Biol. Chem. 262, 622 (1987); 

Johnsson and Weber, Eur. J. Biochem. 188, 1 (1990)), and 
also S. cerevisiae isoleucyl-tRNA synthetase, E. coli 
Lac permease and Tet protein in vivo (Landro and 
Schimmel, Curr . Op. Str . Biol. 3, 549 (1993); Shiba and 
30 Schimmel, Proc . Natl. Acad. Sci . USA 89, 1880 (1992); 

Rubin and Levy, J. Bact. 173, 4503 (1991) ; Wrubel et 
al., J. Bact. 172, 5374 (1990); Bibi and Kaback, Proc. 
Natl. Acad. Sci. USA 87, 4325 (1990)). A previously 
unexplored aspect of protein reconstitution was the 
35 finding by the present inventors that alterations in 
conformational stability of Ub fragments strongly 
influence the efficiency of Ub reassembly. This result, 
together with the discovery that the UBP-mediated 
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cleavage of a Ub fusion requires folded Ub, led the 
inventors to a new assay for in vivo protein 
interactions, . as shown below. 

Experiments were designed to determine whether 
linking of two polypeptides that interact in vivo to N 

ub 

and c ub c °uld facilitate reconstitution of Ub by bringing 
the Ub fragments close together. The C^-dha fusion was 
linked to a region of s. cerevlslae Gcn4 (residues 
235-281) that contained the extensively characterized 
leucine zipper homodimerization domain (denoted as z,) 
but lacked an essential part of the Gcn4 DNA-binding 
domain (Vinson et al. , Science 246, 911 (1989); 
Hinnebusch, Proc . Natl. Acad. Scl . USA 81, 6442 (1984); 
O'Shea et al., Science 254, 539 (1991); Ellenberger et 
15 al., Cell 71, 1223 (1992); Pu and Struhl , Nucl . Acids 
Res. 21, 4348 (1993)). in the resulting z^-dha, a 
3 2 -residue linker, derived from the yeast Ste6 sequence, 
was inserted between the z 1 zipper and C ub (Fig. 3 -XV) to 
ensure that N ub and C ub subdomains could be spatially 
20 proximal within a z^mediated complex between z,C u -dha 

1 ub 

and N ub- Z T 

When expressed by itself, z^-dha remained 
uncleaved at the C^-dha junction, and was slowly 
degraded during the chase, similarly to the results with 
25 c U b" dha / which lacked z 1 . However, coexpression of 

z i c ub~ dha and N^ 3 -z lf bearing a destabilizing Ile-^Gly 
replacement at position 13 of Ub, resulted in a 
significant cleavage of z^-dha (yielding dha) in the 
course of a 30-min chase (Figs. 3-XIV and XV). in 
3 0 contrast, no such cleavage was observed when N^ 3 -z 1 was 
coexpressed with C^-dha, which lacked the z, 
dimerization domain. 

Similar results (but with a faster cleavage of 
z i c ub~ dha ) were obtained upon coexpression of z,C u -dha and 
35 "2 lr which bore Ala instead of wild-type lie at 

position 13 of Ub. Moreover, the enhancement of Ub 
reassembly by z 1 -z 1 interactions was observed even with 
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pairs of Ub fragments that could reconstitute Ub by 
themselves (in the absence of linked z,) . Specifically, 
whereas coexpression of C^-dha and N^-z, or N^-z, 
resulted in detectable but slow cleavage of C ub -dha that 
5 was still incomplete after 3 0 min of chase, coexpression 
of ZiC ub -dha and N^-z, or N^-z, resulted in the nearly 
complete cleavage of z^-dha (yielding dha) by the end 
of a 5-min pulse. 

The temporal resolution of this assay could be 
10 increased by shortening the labeling time from 5 to 2 
min. For example, the amounts of z^-dha cleaved by 
the end of a 2-min pulse progressively increased when 
z i c ub~ dha wa s coexpressed, respectively, with N^ 3 -z 
JMub -z 17 and -z v By contrast, no cleavage of z^-dha 
15 was observed when it was expressed by itself, or when 
JNub z, or -z, were coexpressed with C ub -dha, which 
lacked the z, zipper. 

To determine steady-state levels of ha-containing 
test proteins, whole cell extracts were fractionated by 
SDS-PAGE and analyzed by immunob lotting with anti-ha 
antibody. When C ub -dha was expressed in the absence of 
the N-terminal Ub fragment, the bulk of C^-dha remained 
uncleaved. when N^-z, was coexpressed with c K -dha a 

ub t a 

fraction of C ub -dha was cleaved to yield dha. However, 
when N* b 3 - Zl was coexpressed with z^-dha, virtually all 
of z i c ub~ dha was cleaved to yield dha. Similar results 
were obtained with N^-z,, but the "signal-to-noise- 
ratio was lower, in that a significant fraction of 
z i c ub~ dha remained uncleaved in the presence of N^ 3 -z 
whereas virtually all of z^-dha was cleaved in the'' 
presence of N^-z, . 

Thus, selecting appropriate Ub fragments, altering 
at least one of them to reduce the rate of Ub 
reconstitution by fragments alone, and linking these 
fragments to a pair of test polypeptides resulted in a 
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ubiquitin-based split-protein sensor, or USPS — a new type 
of in vivo assay for kinetic and equilibrium aspects of 
protein interactions. 

USPS differs qualitatively from a recent approach, 
the two-hybrid technique (Fields and Song, Nature 340: 
245 (1989); Chien et al., Proc . Natl. Acad. Sci . USA 88: 
9578 (1991); Guarante, Trends Genet. 90: 1639 (1993); 
Gyuris et al., Cell 75: 791 (1993)), which is based on 
expressing one protein as a fusion to a DNA-binding 
domain of a transcriptional activator, and expressing 
another protein as a fusion to a transcriptional 
activation domain. If the test proteins interact In 
vivo, a transcriptional activator is reconstituted, 
resulting in the induction of a reporter gene. 
Reconstitution of a transcriptional activator in the 
two-hybrid technique involves the generation of 
proximity between two conf ormationally independent 
protein domains whose functions do not depend on direct 
contacts between the domains. 

By contrast, USPS involves the spatial 
(conformational) reconstitution of a single-domain 
protein from its conf ormationally unstable subdomains, 
which acguire an assayable function as a result of their 
direct physical contact and stabilization of their 
conformations. Also in contrast to the USPS method, the 
two-hybrid technique cannot address temporal aspects of 
a protein-protein interaction. In addition, the two- 
hybrid technique limits the set of detectable protein 
interactions to those that occur (or can be 
"reproduced") in the nucleus, in proximity to the 
reporter gene. By contrast, the USPS method makes 
possible the detection and monitoring of a protein- 
protein interaction as a function of time, at the 
natural sites of this interaction in a living cell. 

USPS was demonstrated here with homodimerizing 
polypeptides of the leucine zipper type. In addition, 
USPS was used to detect and analyze in vivo interactions 
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between S. cerevisiae Sec62, an integral membrane 
protein, and the signal sequences of either the 
SXJC2 —encoded invertase or the MFal -encoded precursor of 
a-f actor, a mating pheroraone. The USPS assay detected 
specific, transient interactions between Sec62 and the 
above signal sequences; it also made possible a kinetic 
dissection of these interactions, which have previously 
been demonstrated in a cell-free system but not in vivo. 
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15 10 15 



GTT GAA TCT TCT GAC ACT ATT GAC AAT GTC AAG TCC AAG ATC CAA GAC 
Val Glu Ser Ser Asp Thr lie Asp Asn Val Lys Ser Lys lie Gin Asp 
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AAG GAA GGT ATT CCA CCT GAC CAA CAA AGA TTG ATC TTT GCT rrT i a a 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu lie HI 111 Tly 
" 40 45 

G^ tI?, ^ T o? T AGA ACT TTG TCC GAC TAC ATC CAA AAG GAA 192 

Gin Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu 
DU 55 go 

TCT ACT CTA CAC TTG GTC TTG AGA TTG AGA GGT GGT 
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 
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20 25 30 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys 
Jtl 40 45 

Gin Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu 
* U 55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 - - 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
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Met Arg Ser Gly He Met 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Glu He Ser Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Gly Glu He Ser Thr Leu Glu 
1 5 
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CLAIMS 



1) A fusion protein comprising an N-tenainal subdomain 
of ubiquitin, fused to a non-ubiquitin protein or 
peptide . 

2) The fusion protein of Claim 1 wherein the N- 
terminal subdomain of ubiquitin is mutationally 
altered to reduce its ability to associate with a 
C-terminal subdomain of ubiquitin, thereby 
reconstituting a quasi-native ubiquitin moiety, 
when contacted with a c-terminal subdomain of 
ubiquitin under conditions appropriate for 
protein/protein interaction. 



The fusion protein of claim 2 which is mutationally 
altered by replacing a first neutral amino acid 
residue with a second neutral' amino acid residue 
having a side chain which differs in size from the 
first neutral amino acid residue side chain. 

The fusion protein of claim 3, wherein the first 
amino acid residue is isoleucine. 



5) The fusion protein of Claim 4, wherein the 
isoleucine is either isoleucine 3, or isoleucine 13 
of wild-type ubiquitin. 

6) The fusion protein of Claim 5, wherein the 
isoleucine 3 or isoleucine 13 is replaced with an 
amino acid residue selected from the group 
consisting of glycine, alanine, valine or leucine. 
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7) A fusion protein comprising a C-terminal subdomain 
of ubiquitin, fused to the N-terminus of a non- 
ubiquitin protein or peptide, the fusion protein 
being cleavable by a ubiquitin-specif ic protease 
under conditions wherein the C-terminal subdomain 
of ubiquitin associates with an N-terminal 
subdomain of ubiquitin to form a quasi-native 
ubiquitin moiety, the products of the cleaved 
fusion protein being distinguishable from the 
uncleaved fusion protein. 

3) The fusion protein of Claim 7 further comprising an 
N-terminal protein or peptide extension of the 
ubiquitin subdomain. 

0 The fusion protein of Claim 7 wherein the non- 
ubiquitin protein or peptide contains, or is 
attached to an epitope recognized by an antibody, 
thereby facilitating immunological detection or 
isolation of the fusion protein or, following the 
cleavage of the fusion protein by a ubiquitin- 
specific protease, facilitates the immunological 
detection or isolation of the subdomain of the 
fusion protein located C-terminally relative to the 
C-terminal ubiquitin subdomain in the uncleaved 
fusion protein. 



10) 



The fusion protein of Claim 9 further comprising 
N-terminal protein or peptide extension of the 
ubiquitin subdomain. 
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11) The fusion protein of Claim 7 wherein the C- 
terminal subdomain of ubiquitin is mutationally 
altered to reduce its ability to associate with a 
N-terminal subdomain of ubiquitin, thereby 
reconstituting a quasi-native ubiquitin moiety, 
when contacted with a N-terminal subdomain of 
ubiquitin under conditions appropriate for 
protein/protein interaction . 

12) The fusion protein of Claim 11 further comprising 
an N-terminal protein or peptide extension of the 
C-terminal ubiquitin subdomain moiety. 

13) The fusion protein of Claim 11 wherein the non- 
ubiquitin protein or peptide contains, or is 
attached to an epitope recognized by an antibody, 
thereby facilitating immunological detection or 
isolation of the fusion protein or, following 
cleavage of the fusion protein by a ubiquitin- 
specific protease, facilitates the immunological 
detection or isolation of the subdomain of the 
fusion protein located C-terminally relative to the 
C-terminal ubiquitin subdomain in the uncleaved 
fusion protein. 

14) The fusion protein of Claim 13 further comprising 
an N-terminal protein or peptide extension of the 
C-terminal ubiquitin subdomain moiety. 

15) A DNA-based expression vector containing an 
expression cassette encoding a fusion protein 
comprising an N-terminal subdomain of ubiquitin, 
fused to a non-ubiquitin protein or peptide. 
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16) A DNA-based expression vector containing an 

expression cassette encoding a fusion protein 
comprising a C-terminal subdomain of ubiquitin, 
fused to the N-terminus of a non-ubiquitin protein 
or peptide, the fusion protein being cleavable by a 
ubiquitin-specific protease under conditions 
wherein the C-terminal subdomain of ubiquitin 
associates with an N-terminal subdomain of 
ubiquitin to form a quasi-native ubiquitin moiety, 
the products of the cleaved fusion protein being 
distinguishable from the uncleaved fusion protein. 

17) A DNA expression library comprising a DNA-based 
expression vector containing an expression cassette 
comprising DNA encoding an N-terminal subdomain of 
ubiquitin, fused to termini of DNA fragments from 
an organism of interest. 

18) The DNA expression library of claim 17 wherein the 
DNA encoding the N-terminal subdomain of ubiquitin 
is mutationally altered to encode an N-terminal 
subdomain of ubiquitin having reduced ability to 
associate with a C-terminal subdomain of ubiquitin 
to reconstitute a quasi-native ubiquitin moiety 
when contacted with a C-terminal subdomain of 
ubiquitin under conditions appropriate for 
protein/protein interaction. 

19) A DNA expression library comprising a DNA-based 

expression vector containing an expression cassette 
comprising DNA encoding a C-terminal subdomain of 
ubiquitin fused to termini of a DNA fragment of an 
organism of interest, the fusion protein encoded by 
the expression cassette comprising the C-terminal 
subdomain of ubiquitin fused to the N-terminus of a 
polypeptide encoded by the DNA fragment of the 
organism of interest, the fusion protein being 
cleavable by a ubiquitin-specific protease under 
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conditions wherein the C-terminal subdomain of 
ubiquitin associates with an N-terminal subdomain 
of ubiquitin to form a quasi-native ubiquitin 
moiety, the products of the cleaved fusion protein 
being distinguishable from the uncleaved fusion 
protein. 

20) The DNA expression library of Claim 19 wherein the 
DNA encoding the C-terminal subdomain of ubiquitin 
is mutationally altered to encode a C-terminal 
subdomain of ubiquitin having reduced ability to 
associate with an N-terminal subdomain of ubiquitin 
to reconstitute a quasi-native ubiquitin moiety 
when contacted with an N-terminal subdomain of 
ubiquitin under conditions appropriate for 
protein/protein interaction. 

21) A method for identifying interacting proteins or 
peptides , comprising : 

a) providing a first DNA-based expression vector 
containing an expression cassette encoding an 
N-terminal subdomain of ubiquitin fused to DNA 
encoding a first protein or peptide; and 
providing a second DNA-based expression vector 
containing an expression cassette encoding a 
C-terminal subdomain of ubiquitin fused to 
randomly generated DNA fragments produced by 
cleaving cDNA from an organism of interest, 
the encoded fusion protein being cleavable by 
a ubiquitin-specific protease under conditions 
wherein the C-terminal subdomain of ubiquitin 
associates with an N-terminal subdomain of 
ubiquitin to form a quasi-native ubiquitin 
moiety; either the first DNA sequence, the 
second DNA sequence, or both DNA sequences 
being mutationally altered to reduce the 
ability of the encoded N- and C-terminal 
subdomains of ubiquitin to associate to 
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reconstitute a quasi-native ubiquitin moiety 
when co-expressed in a cell; 

b) co-transforming a cell with the first DNA- 
based expression vector and the second DNA— 
based expression vector; and 

c) detecting cells in which the fusion protein 
encoded by the second DNA-based expression 
vector is cleaved by a ubiquitin-specif ic 
protease, the cleavage being indicative of 
interaction between the first protein or 
peptide and a second protein or peptide 
encoded by the randomly generated DNA from an 
organism of interest. 

) A method for identifying interacting proteins or 
peptides, comprising: 

a) providing a first DNA-based expression vector 
containing an expression cassette encoding an 
N-terminal subdomain of ubiquitin fused to a 
randomly generated DNA fragment from an 
organism of interest; providing a second DNA- 
based expression vector encoding a C-terminal 
subdomain of ubiquitin fused to DNA encoding a 
first protein or peptide, the encoded fusion 
protein being cleavable by a ubiquitin- 
specif ic protease under conditions wherein the 
C-terminal subdomain of ubiquitin associates 
with an N-terminal subdomain of ubiquitin to 
form a quasi-native ubiquitin moiety; either 
the first DNA sequence, the second DNA 
sequence, or both DNA sequences being 
mutationally altered to reduce the ability of 
the encoded N- and C-terminal subdomains to 
associate to reconstitute a quasi-native 
ubiquitin moiety when co-expressed in a cell; 
b) co-transforming a cell with the first DNA- 
based expression vector and the second DNA- 
based expression vector; and 
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c) detecting cells in which the fusion protein 
encoded by the second DNA-based expression 
vector is cleaved by a ubiquitin-specif ic 
protease, the cleavage being indicative of 
interaction between the first protein or 
peptide and a second protein or peptide 
encoded by the randomly generated DNA fragment 
from the organism of interest. 

23. A method for identifying interacting proteins or 
peptides, comprising: 

a) providing a first fusion protein comprising an 
N-terminal subdomain of ubiquitin fused to a 
first protein or peptide; providing a second 
fusion protein comprising a C-terminal 
subdomain of ubiquitin fused at its N-terminus 
to a second protein or peptide and fused at 
its C-terminus to a reporter protein which is 
inactive when N-terminal ly extended, the 
second fusion protein being cleavable by a 
ubiquitin-specif ic protease under conditions 
wherein the C-terminal subdomain of ubiquitin 
associates with an N-terminal subdomain of 
ubiquitin to form a quasi-native ubiquitin 
moiety; either the N-terminal subdomain of 
ubiquitin or the C-terminal subdomain of 
ubiquitin, or both being mutationally altered 
to reduce the ability of the N— and C-terminal 
subdomains to associate to reconstitute a 
quasi-native ubiquitin moiety when contacted 
under conditions appropriate for 
protein/protein interaction; 

b) fixing either the first fusion protein or the 
second fusion protein to a solid support 
thereby creating a fixed fusion protein; 

c) contacting the fixed fusion protein with a 
solution containing a fusion protein selected 
from the group consisting of the first fusion 
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protein or the second fusion protein under 
conditions appropriate for protein/protein 
interaction, the fusion protein in solution 
being the fusion protein which was not fixed 
to the solid support in step b) ; 
contacting a quasi-native ubiquitin complex 
formed in step c) with a ubiquitin-specif ic 
protease under conditions appropriate for 
protease activity; and 
e) detecting cleavage of the quasi-native 

ubiquitin complex by assaying for activity of 
the reporter protein which is active following 
cleavage which removes its N-terminal 
extension, the cleavage being indicative of 
protein interaction. 



24) A method for identifying interacting proteins or 
peptides, comprising: 

a) providing a first DNA-based expression vector 
containing an expression cassette containing a 
first DNA sequence encoding an N-terminal 
subdomain of ubiquitin fused to DNA encoding a 
first protein or peptide; and providing a 
second DNA-based expression vector containing 
an expression cassette containing a second DNA 
sequence encoding a C-terminal subdomain of 
ubiquitin fused at its N-terminus to randomly 
generated DNA fragments produced by cleaving 
DNA from an organism of interest, and fused at 
its C-terminus to a reporter protein which is 
inactive when N-terminal ly extended, the 
encoded fusion protein being cleavable by a 
ubiquitin-specific protease under conditions 
wherein the C-terminal subdomain of ubiquitin 
associates with an N-terminal subdomain of 
ubiquitin to form a quasi-native ubiquitin 
moiety; either the first DNA sequence, the 
second DNA sequence, or both DNA sequences 
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being mutationally altered to reduce the 
ability of the encoded N- and C-terminal 
subdomains of ubiquitin to associate to 
reconstitute a quasi-native ubiquitin; 

b) expressing the fusion proteins encoded by the 
first and second DNA-based expression vectors, 
either together or independently, in a cell 
which does not contain a ubiquitin-specif ic 
protease; 

c) purifying the fusion protein encoded by the 
first DNA-based expression vector; 

d) affixing the purified fusion protein to a 
solid support, thereby creating a fixed fusion 
protein; 

e) contacting the fixed fusion protein with a 
solution containing the fusion protein encoded 
by the second DNA-based expression vector, 
under conditions appropriate for 
protein/protein interaction ; 

f ) contacting a quasi-native ubiquitin complex 
formed in step e) with a ubiquitin-specif ic 
protease under conditions appropriate for 
protease activity; and 

g) detecting cleavage of the quasi-native 
ubiquitin complex by assaying for activity of 
the reporter protein which is active following 
cleavage which removes its N-terminal 
extension, the cleavage being indicative of 
protein interaction . 

25) A method for identifying interacting proteins or 
peptides , comprising : 

a) providing a first DNA-based expression vector 
containing an expression cassette containing a 
first DNA sequence encoding an N-terminal 
subdomain of ubiquitin fused to randomly 
generated DNA fragments produced by cleaving 
DNA from an organism of interest; and 
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providing a second DNA-based expression vector 
containing an expression cassette containing a 
second DNA sequence encoding a C-terminal 
subdomain of ubiquitin fused at its N-terminus 
to a second protein or peptide and fused at 
its C-terminus to a reporter protein which is 
inactive when N-terminally extended, the 
encoded fusion protein being cleavable by a 
ubiquitin-specif ic protease under conditions 
wherein the C-terminal subdomain of ubiquitin 
associates with an N-terminal subdomain of 
ubiquitin to form a quasi-native ubiquitin 
moiety; either the first DNA sequence, the 
second DNA sequence, or both DNA sequences 
being mutationally altered to reduce the 
ability of the encoded N- and C-terminal 
subdomains of ubiquitin to associate to 
reconstitute a quasi-native ubiquitin moiety; 

b) expressing the fusion proteins encoded by the 
first and second DNA-based expression vectors, 
either together or independently, in a cell 
which does not contain a ubiquitin-specif ic 
protease; 

c) purifying the fusion protein encoded by the 
second DNA-based expression vector; 

d) affixing the purified fusion protein from step 
c) to a solid support thereby creating a fixed 
fusion protein; 

e) contacting the fixed fusion protein with a 
solution containing the fusion protein encoded 
by the first DNA-based expression vector, 
under conditions appropriate for 
protein/protein interaction ; 

f) contacting a quasi-native ubiquitin complex 
formed in step e) with a ubiquitin-specif ic 
protease under conditions appropriate for 
protease activity; and 
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g) detecting cleavage of the quasi-native 

ubiguitin complex by assaying for activity of 
the reporter protein which is active following 
cleavage which removes its N-terminal 
extension, the cleavage being indicative of 
protein interaction. 

2 6) A method for determining a ligand in a sample to be 
tested for the ligand, comprising: 

a) providing a first fusion construct comprising 
an N-terminal subdomain of ubiguitin linked to 
an affinity reagent which binds to the ligand; 
providing a second fusion construct comprising 
a C-terminal subdomain of ubiguitin linked at 
its N-terminus to the ligand and at its C- 
terminus to a reporter which is inactive when 
linked to the C-terminal subdomain of 
ubiguitin, the second fusion construct being 
cleavable by a ubiquitin-specif ic protease 
under conditions wherein the C-terminal 
subdomain of ubiquitin associates with the N- 
terminal subdomain of ubiquitin to form a 
quasi-native ubiquitin moiety; either the N- 
terminal subdomain of ubiquitin or the c- 
terminal subdomain of ubiquitin, or both being 
mutationally altered to reduce the ability of 
the N- and C-terminal subdomains to associate 
to reconstitute a quasi-native ubiquitin 
moiety when contacted under conditions 
appropriate for protein/protein interaction; 

b) incubating predetermined" quantities of the 
first and second fusion constructs under 
conditions appropriate for binding of the 
ligand to the affinity reagent; 

c) contacting the incubation mixture of step b) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 
and 
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d) detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step b) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specif ic protease; 

e) repeating steps b)-d) with the addition of 
predetermined quantities of unbound ligand, 
which acts as a competitor with the ligand 
component of the first fusion construct, 
thereby effecting a decrease in the 
reconstitution of the quasi-native ubiquitin 
moiety and a subsequent decrease in the 
reporter activity; 

f ) incubating the sample to be tested for the 
ligand with the first and second fusion 
constructs under conditions which are 
otherwise identical to those of step b) ; 

g) contacting the incubation mixture of step f ) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 

h) detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step f) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specif ic protease; and 

i) comparing the level of reporter activity with 
the level of reporter activity determined in 
steps d) and e) to determine the ligand in the 
sample to be tested for the ligand. 

27) The method of Claim 2 6 wherein the affinity reagent 
is a protein. 

28) The method of Claim 2 7 wherein the protein is an 
antibody. 

29) A method for determining a ligand in a sample to be 
tested for the ligand, comprising: 
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a) providing a first fusion construct comprising 
an N-terminal subdomain of ubiquitin linked to 
the ligand; providing a second fusion 
construct comprising a C-terminal subdomain of 
ubiquitin linked at its N-terminus to an 
affinity reagent which binds to the ligand and 
at its C-terminus to a reporter which is 
inactive when linked to the C-terminal 
subdomain of ubiquitin, the second fusion 
construct being cleavable by a ubiquitin- 
specific protease under conditions wherein the 
C-terminal subdomain of ubiquitin associates 
with the N-terminal subdomain of ubiquitin to 
form a quasi-native ubiquitin moiety; either 
the N-terminal subdomain of ubiquitin or the 
C-terminal subdomain of ubiquitin, or both 
being mutationally altered to reduce the 
ability of the N- and C-terminal subdomains to 
associate to reconstitute a quasi-native 
ubiquitin moiety when contacted under 
conditions appropriate for protein/protein 
interaction; 

b) incubating predetermined quantities of the 
first and second fusion constructs under 
conditions appropriate for binding of the 
ligand to the affinity reagent; 

c) contacting the incubation mixture of step b) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 

d) detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step b) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specif ic protease; 
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e) repeating steps b)-d) with the addition of 
predetermined quantities of unbound ligand, 
which acts as a competitor with the ligand 
component of the first fusion construct, 
thereby effecting a decrease in the 
reconstitution of the quasi-native ubiquitin 
moiety and a subsequent decrease in the 
reporter activity; 

f) incubating the sample to be tested for the 
ligand with the first and second fusion 
constructs under conditions which are 
otherwise identical to those of step b) ; 

g) contacting the incubation mixture of step f) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 

h) detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step f) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specif ic protease; and 

i) comparing the level of reporter activity with 
the level of reporter activity determined in 
steps d) and e) to determine the ligand in the 
sample to be tested for the ligand. 

30) The method of Claim 29 wherein the affinity reagent 
is a protein. 

31) The method of Claim 30 wherein the protein is an 
antibody, 

32) A method for identifying an inhibitor of the 
binding of a ligand to an affinity reagent, 
comprising: 

a) providing a first fusion construct comprising 
an N-terminal subdomain of ubiquitin linked to 
an affinity reagent which binds to the ligand; 
providing a second fusion construct comprising 
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a C-terminal subdomain of ubiquitin linked at 
its N-terminus to the ligand and at its C- 
terminus to a reporter which is inactive when 
linked to the C-terminal subdomain of 
ubiquitin, the second fusion construct being 
cleavable by a ubiquitin-specif ic protease 
under conditions wherein the c-terminal 
subdomain of ubiguitin associates with an N- 
terminal subdomain of ubiquitin to form a 
quasi-native ubiquitin moiety; either the N- 
terminal subdomain of ubiquitin or the C- 
terminal subdomain of ubiquitin, or both being 
mutationally altered to reduce the ability of 
the N- and C-terminal subdomains to associate 
to reconstitute a quasi-native ubiquitin 
moiety when contacted under conditions 
appropriate for protein/protein interaction; 
incubating predetermined quantities of the 
first and second fusion constructs under 
conditions appropriate for binding of the 
ligand to the affinity reagent; 
contacting the incubation mixture of step b) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 
and 

detecting cleavage of a guasi-native ubiquitin 
complex present in the incubation mixture of 
step b) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specif ic protease; 
repeating steps b) -d) with the addition of 
compounds to be tested for the ability to 
interfere with the binding of the ligand to 
the affinity reagent, a decrease in the 
reporter activity resulting from the inclusion 
of the compound to be tested for the 
interfering ability, in an assay which 
otherwise identical to the assay of steps b) - 
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d) being indicative of the presence of a 
compound having the ability to interfere with 
the binding interaction of the ligand and the 
affinity reagent. 

33) The method of Claim 3 2 wherein the affinity reagent 
is a protein. 

34) The method of Claim 3 3 wherein the protein is an 
antibody. 

35) A method for identifying an inhibitor of the 
binding of a ligand to an affinity reagent, 
comprising: 

a) providing a first fusion construct comprising 
an N-terminal subdomain of ubiquitin linked to 
the ligand; providing a second fusion 
construct comprising a C-terminal subdomain of 
ubiquitin linked at its N-terminus to an 
affinity reagent which binds to the ligand and 
at its C-terminus to a reporter which is 
inactive when linked to the C-terminal 
subdomain of ubiquitin, the second fusion 
construct being cleavable by a ubiquitin- 
specific protease under conditions wherein the 
C-terminal subdomain of ubiquitin associates 
with an N-terminal subdomain of ubiquitin to 
form a quasi-native ubiquitin moiety; either 
the N-terminal subdomain of ubiquitin or the 
C-terminal subdomain of ubiquitin, or both 
being mutationally altered to reduce the 
ability of the N- and C-terminal subdomains to 
associate to reconstitute a quasi-native 
ubiquitin moiety when contacted under 
conditions appropriate for a protein/protein 
interaction; 
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c) 



36) 



b) incubating predetermined quantities of the 
first and second fusion constructs under 
conditions appropriate for binding of the 
ligand to the affinity reagent; 
contacting the incubation mixture of step b) 
with a ubiquitin-specific protease under 
conditions appropriate for protease activity; 
and 

detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step b) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specific protease; 
e) repeating steps b)-d) with the addition of 
compounds to be tested for the ability to 
interfere with the binding of the ligand to 
the affinity reagent, a decrease in the 
reporter activity resulting from the inclusion 
of the compound to be tested for the 
interfering ability, in an assay which is 
otherwise identical to the assay of steps b) - 
d) , being indicative of the present of a 
compound having the ability to interfere with 
the binding interaction of the ligand and the 
affinity reagent. 

The method of Claim 3 5 wherein the affinity reagent 
is a protein. 



37) The method of Claim 3 6 wherein the protein is an 
antibody. 
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38) A method for determining a ligand in a sample to be 
tested for the presence of the ligand, comprising: 
a) providing a first fusion construct comprising 
an N-terminal subdomain of ubiquitin linked to 
a first affinity reagent which specifically 
binds to first epitope of the ligand; 
providing a second fusion construct comprising 
a C-terminal subdomain of ubiquitin linked at 
its N-terminus to a second affinity reagent 
which specifically binds to a second epitope 
of the ligand and linked at its C-terminus via 
an amide bond to a reporter which is inactive 
when linked to the C-terminal subdomain of 
ubiquitin, the second fusion construct being 
cleavable by a ubiquitin-specif ic protease 
under conditions wherein the C-terminal 
subdomain of ubiquitin associates with an N- 
terminal subdomain of ubiquitin to form a 
quasi-native ubiquitin moiety; either the N- 
terminal subdomain of ubiquitin or the C- 
terminal subdomain of ubiquitin, or both being 
mutationally altered to reduce the ability of 
the N— and C-terminal subdomains to associate 
to reconstitute a quasi-native ubiquitin 
moiety when contacted under conditions 
appropriate for protein/protein interaction; 

b) incubating predetermined quantities of the 
first and second fusion constructs under 
conditions appropriate for protein/protein 
interaction; 

c) contacting the incubation mixture of step b) 
with a ubiquitin-specif ic protease under 
conditions appropriate for protease activity; 
and 
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40) 



d) 



e) 



detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step b) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specific protease; 
repeating steps b)-d) with the addition of 
predetermined quantities of unbound ligand 
which functions to link the first fusion 
construct to the second fusion construct by 
virtue of its binding interaction with the 
first and second affinity reagents, thereby 
effecting an increase in the reconstitution of 
the quasi-native ubiquitin moiety and an 
increase in the reporter activity; 
incubating the sample to be tested for the 
ligand with the first and second fusion 
constructs under conditions which are 
otherwise identical to those of step b) ; 
contacting the incubation mixture of step f) 
with a ubiquitin-specific protease under 
conditions appropriate for protease activity; 
detecting cleavage of a quasi-native ubiquitin 
complex present in the incubation mixture of 
step f ) by assaying for activity of the 
reporter which is active following cleavage by 
the ubiquitin-specific protease; and 
i) comparing the level of reporter activity with 
the level of reporter activity determined in 
steps d) and e) to determine the ligand in the 
sample to be tested for the ligand. 

39) The method of Claim 3 8 wherein the first and second 
affinity reagents are proteins which bind to non- 
overlapping epitopes of the ligand. 



h) 



The method of Claim 3 9 wherein the first and second 
affinity reagents are antibodies which bind to non- 
overlapping epitopes of the ligand. 
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ll) A fusion protein comprising an N-terminal subdomain 
of ubiquitin, or a C-terminal subdomain of 
ubiquitin, linked to an affinity reagent. 

2) The fusion protein of Claim 41 wherein the affinity 
reagent is a protein. 

3) The fusion protein of Claim 42 wherein the protein 
is an antibody. 

4) A fusion protein comprising an N-terminal subdomain 
of ubiquitin, or a C-terminal subdomain of 
ubiquitin, linked to a ligand. 



WO 95/29195 



PCT/US95/04536 



1/5 




SUBSTITUTE SHEET (RULE 26) 



WO 95/29195 



PCT/US95/04536 



2/5 



C 




N 



FIGURE 2. 



WO 95/29195 



PCT/US95/04536 



3/5 



76 



I I Ub-DHFR 
« Ub-DHFR 
I Ub AT3 -DHFR 
'V / Ub G "-DHFR 
V I Ufa^HFH 





SteS 


VI 


1 Lfb'" 68 -DHFR 


VII 


Ub -DHFR 


VIII 


UD -DHFR 


IX 


-DHFR 



234 262 35 
MGG" 




x I c; 

37 235 



wt 



FJSTLE 




XI 


1 fC-z, 


x„ 




XIII 




XIV 1 


N ub -z, 



235 ■ 281 

J 234 262 35 

S!e6 



XV 




wt 

Z '-C ub -DHFR 



FIGURE 3. 



WO 95/29195 



4/5 



PCT/OS95/04536 




Ub-Specific 
protease 
^ cleavage 



Figure 4C j 



Figure 4D 



SUBSTITUTE SHEET (RULE 26) 



WO 95/29195 



PCT/US95/04536 



5 / 5 




SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US95/04536 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC (6) :C07K 19/00; C12N 15/62, 15/63; C12Q 1/68; GOIN 33/53, 33/566 
.US CL : Please See Extra Sheet. 
According to International Patent Classification (IPC) or to both national classification and TPC 

B, FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 530/350, 403; 435/6, 7.72, 7.8, 7.9, 7.93, 7.94, 172.3, 320.1, 69.7; 536/23.4 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


US, A, 5,283,173 (FIELDS ET AL.) 01 February 1994 


1-44 


A 


PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCE, 
Volume 90, issued March 1993, L, Guarente, "Strategies for 
the Identification of Interacting Proteins", pages 1 639-1 641 . 


21-40 


A 


JOURNAL OF MOLECULAR BIOLOGY, Volume 194, issued 
1987, S, Vijay-Kumar et al., "Structure of Ubiquitin Refined 
at 1,8 AResolution", pages 531-544. 


1-44 



j x| Further documents are listed in the continuation of Box C. j j See patent family annex. 



Special categories of cited documents: 

document defining the general ataie of the art which is not considered 
to be of particular relevance 

earlier document published on or after the international filing dale 

document which may throw doubts on priority claim(s) or which is 
cited to establish the publication date of another citation or other 
special reason (a* specified) 



document referring to an oral disclosure, i 



exhibition or other 



doct 



1 published prior to the international filing date but later *h n n 



the priority date claimed 



"T" later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand the 
principle or theory underlying the invention 

"X' document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive itep 
when the document is taken alone 

"Y" document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

'&.* document member of the same patent family 



Date of the actual completion of the international search 
13 JULY 1995 


Date of mailing of the international search report 

03AUG1995 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
. Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer ; •? ; / t ->0 

REBECCA PROUTY 
Telephone No. (703) 308-0196 



Form PCT/ISA/210 (second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US95/04536 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages Relevant to claim 



No 



JOURNAL OF BIOMOLECULAR NMR, Volume 3, issued 1993, 
BJ. Stockman et aL, "Heteronuclear Three-Dimensional NMR 
Spectroscopy of a Partially Denatured Protein: The A-State of 
Human Ubiquitin", pages 285-296. 

JOURNAL OF MOLECULAR BIOLOGY, Volume 234, issued 
1993, J.P.L. Cox et ah, "Dissecting the Structure of a Partially 
Folded Protein, Circular Dichoiism and Nuclear Magnetic 
Resonance Studies of Peptides From Ubiquitin", pages 483-492. 



1-44 



1-44 



Form PCT/ISA/210 (continuation of second sheet)(Juiy 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US95/04536 



A. CLASSIFICATION OF SUBJECT MATTER" 
US CL : 



530/350, 403; 435^ 7.72, 7.8, 7.9, 7.93, 7.94, 172.3, 320.1, 69.7; 536/23.4 
B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 
APS, MEDLINE, BIOSIS, EMBASE, LIFESCI, CA, BIOTECH DS WPI 

^^J^SSTSS^ or determin? or mah0 ' m - ubkluilin - fragmem * ° r ^ - d — 



Form PCT/ISA/210 (extra shect)(JuIy 1992)* 



